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Abstract. A geodesic in a graph G is a shortest path between two vertices of G. 
For a specific function e(n) of n, we define an almost geodesic cycle C in G to be a 
cycle in which for every two vertices u and v in C, the distance dc(u,v) is at least 
dc{u,v) — e(n). Let uj(n) be any function tending to infinity with n. We consider 
a random d-regular graph on n vertices. We show that almost all pairs of vertices 
belong to an almost geodesic cycle C with e(n) = log d _ 1 log d _ 1 n + co(n) and |C| = 
21og d _ 1 n + 0(u)(n)). Along the way, we obtain results on near-geodesic paths. We 
also give the limiting distribution of the number of geodesies between two random 
vertices in this random graph. 



1. Introduction 

A geodesic in a graph G is a shortest path between two vertices of G. Let uj(n) be any 
function tending to infinity with n, and put e(n) = log d _ 1 log^ n + u(n). We define 
an almost geodesic cycle C in G to be a cycle in which for every two vertices u and 
v in C, the distance da(u,v) is at least dc(u,v) — e(n). We investigate the existence 
of almost geodesic cycles through random pairs of vertices in a random graph, and 
related questions on geodesies and paths that are nearly geodesic, in a sense to be 
made precise. Our results refer to the probability space of random d-regular graphs 
with uniform probability distribution. This space is denoted Q n ^ and asymptotics 
(such as "asymptotically almost surely", which we abbreviate to a.a.s.) are for n — > oo 
with d > 3 fixed, and n even if d is odd. 

Some related previous research focussed on finding (edge/internally)-disjoint paths 
with many sources and targets. Frieze and Zhao [3] showed that for sufficiently large d 
there exist fixed positive constants a and (3 such that a graph G taken from Gn,d a.a.s. 
has the following property: for any choice of k pairs {(a*, hi) \ i = 1, . . . , k}, satisfying 

(i) k < \adn/ \og d n] , and 

(ii) for each vertex v: \i : Oj = v\ + \i : hi = v \ < (3d, 

there exist edge-disjoint paths in 67 connecting ai to hi for all % = 1, 2, . . . , k. This result 
is optimal up to constant factors. The paths returned by their algorithm are of length 
of at least 10 log d n. 

Our focus is different as it comes from different motivation: studying almost geodesic 
cycles in Q n ^- Our result on internally disjoint paths refers to one pair of vertices fixed 
before the graph is chosen. This is a much weaker model than the model of [3], that dealt 
with 0(n/logn) pairs given by an adversary after the graph is chosen. However, we 
show the existence of disjoint paths that approximate the optimal path (whose length 
is a.a.s. in [log^n — Lo(n), log d _ 1 n + uj{n)]) by an additive factor of log d _ 1 log d _ 1 n, 
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whereas the result of [3] give at best a constant multiplicative factor. Additionally, our 
result holds for all d > 3, and that we find the maximum possible number of internally 
disjoint paths, d, that there can possibly be between two vertices. 

Theorem 1.1. Take any integer d > 3 and any function uj{n) with u{n) — > oo. Let 
G G Q n ,d and choose vertices u and v in V(G) independently with uniform probability. 
Then a.a.s. the following hold: 

(i) \d(u, v) - log^ n| < uj{n), 

(ii) there are d paths connecting u and v such that the subgraph induced by each pair 
of these paths is an almost geodesic cycle. 

Note that the d paths in (ii) theorem are internally disjoint because each pair of them 
induces a cycle. 

In a slightly different direction, we also investigate the distribution of the number of 
geodesies joining two vertices (see Theorem 12.61) . 

We may obtain the lower bound in part (i) of the theorem from an elementary 
observation. Note that, given G G Q n ,d, the number of vertices at distance at most i 
from a vertex u is bounded above by 



independently with uniform probability, then the probability that the second vertex is 
at distance at most i = log d _ 1 n — u)(n) from the first is at most 



if u, v are vertices chosen independently with uniform probability in G G Q n ,d and u(n) 
is a function satisfying u{n) oo. The fact that a.a.s. d(u, v) < log^ n + u{n) will 
follow from our study of the distribution of the number of geodesies in G G Q n ,d- 

The rest of the proof requires more sophisticated arguments. Instead of working 
directly in the uniform probability space of random regular graphs on n vertices Q n ,d, 
we use the pairing model of random regular graphs, first introduced by Bollobas j2], 
which is described next. Suppose that dn is even, as in the case of random regular 
graphs, and consider dn points partitioned into n labelled cells v\, . . . , v n of d points 
each. A pairing of these points is a perfect matching of them into dn/2 pairs. Given 
a pairing P, we may construct a multigraph G(P), with loops allowed, as follows: the 
vertices are the cells vi,. .., v n , and a pair {x,y} in P corresponds to an edge ViVj 
in G(P) if x and y are contained in the cells Vi and Vj, respectively. It is an easy 
fact that the probability of a random pairing corresponding to a given simple graph G 
is independent of the graph, hence the restriction of the probability space of random 
pairings to simple graphs is precisely Q n ,d- Moreover, it is well known that a random 
pairing generates a simple graph with probability asymptotic to a constant depending 
on d, so that any event holding a.a.s. over a probability space of random pairings also 
holds a.a.s. over the corresponding space Q n> d- For this reason, asymptotic results over 
random pairings suffice for our purposes. The advantage of using this model is that the 





and therefore, a.a.s. 
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pairs may be chosen sequentially so that the next pair is chosen uniformly at random 
over the remaining (unchosen) points. For more information on this model, see [5]. 

The numbers of geodesies are investigated in Section |21 Theorem 11.11 is proved in 
Section El Some final remarks are in Section 0] 



2. Distribution of the number of geodesics 

The first portion of our argument is a simplified version of part of the argument of 
Bollobas and Fernandez de la Vega [3]. We consider the process in which the neigh- 
bourhoods of u and v are exposed step by step. First, the neighbours of u and v are 
revealed, then the vertices at distance two, and so on. This sequential exposure of the 
random regular graph is analysed using the random pairing model mentioned in the 
Introduction. 

Let Ni(u) denote the set of vertices at distance at most i from u. Note that, in the 
early stages of this process, the graphs grown from u and v tend to be trees, hence the 
number rii of elements in Ni(u) is approximately 

m-! + (d- l)(ri;_i - n^ 2 ) ■ 

Let fi denote the number of vertices in a balanced <i-regular tree, that is, 



fi = 1 + d j2(d-iy = i + 

j=0 



d((d-l] 



d 



and let 



'o 



2 lo grf-i n 



Lemma 2.1. Let uj(n) be any function of n such that uj{n) — > oo. For i < iq — uj{n) 
a.a.s. the cardinality Ui of Ni(u) equals fi. Moreover, fori < zq + u)(n) a.a.s. 



Hi = fi -0(u(n)(d - l)3(i-io)+u>(n)\ 



Proof. First note that it is sufficient to consider the case when uj{n) = o(logn). 

Since fi denotes the number of vertices in a balanced tree where every non-leaf 
vertex has degree d, the first assertion follows if we show that a.a.s. the set of vertices at 
distance at most i < io — uj(n) of a vertex u induces a tree. In other words, if we expose, 
step by step, the vertices at distance 1,2, ... ,i from u, we have to avoid, at step j, edges 
that induce cycles. So, we wish not to find edges between any two vertices at distance 
j from u or edges that join any two vertices at distance j to a same vertex at distance 
j + 1 from u. We shall refer to edges of this form as 'bad edges'. Note that the expected 
number of 'bad edges' at step i + 1 is equal to 0{nf/n) = 0(ff jri) = 0((d — 1) 2% /n). 

Consider i 1 = [| log d _! n — oj{n)\ . The expected number of 'bad edges' found up to 
step i\ is equal to 

u-i 

J20((d-l) 2 i/n) =0((d-l) 2h /n) = 0((d - l)" 2 ^)) = (1) . 

3=0 

Thus, by Markov's inequality, a.a.s. there are no 'bad edges' until step i\, hence a.a.s. 
N^iu) is a tree and rii = f\ for i <i\. 
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In order to prove the second assertion, notice that the expected number of 'bad edges' 
added between step i\ + 1 and step i, i < [i + uj{n)\ = |_| ^og d _ l n + oj{n)\ is equal to 



i-l 



J20{(d-l) 2j /n) = 0((d-l) 2l /n) = 0((d - l) 2 ^) . 

j=h 

Thus, again by Markov's inequality, a.a.s. the total number of 'bad edges' at time i is 
at most 0(u(n)(d - l) 2 ^)). Notice that one 'bad edge' added in this time interval 
can destroy a tree branch of size 0((d-iy- io+ "W). This occurs because the 'bad edge' 
creates a cycle instead of exposing a new vertex v. The branch of descendants of v, 
which would appear had v been exposed and had the process continued as a balanced d- 
regular tree, is therefore destroyed and has size at most l + (d—l) + - ■ -+(d— i)*-*o+w(n) _ 

o((d- iy-i°+"Wy 

Thus, we have a.a.s. 

m = fi- 0(u{n){d - l) 2 ^)) -0({d- i)*-*o+"(«)). 
This completes the proof of the lemma. □ 

Immediately from this lemma, we have 
Corollary 2.2. For i = iq + o(logn) a.a.s. 

ni = fi- n°« = nV 2+ °W . 

In the remainder of this notes, let u, v be vertices chosen independently with uniform 
probability in a graph G £ Q n ^ and consider the process of exposing the neighbourhoods 
of u and v introduced in Lemma 12.11 We say that iVj(u) and Ni(v) join at time i if 
n N^v) = and N^u) n Nt(v) ^ 0. 

Also, whenever a result that holds for any uj{n) satisfying \\m n ^ 00 uj{n) = oo is 
proven, we shall assume without loss of generality that u(n) = o(logn). 

Lemma 2.3. Let k be a fixed integer and define j(n, d) — ^ \og d _ 1 n — i , the fractional 
part of \ log^ n. Then 

d{d- i)2*-27(M)' 



F(N io+k {u) n N io+k {v) = 0) ~ exp 



d-2 



Proof. Denote Si the event that the neighbourhoods of u and v are separate at time i, 
that is, Nj(u) and Nj(v) did not join up to time i. We claim that 

F(S i0+k | S i0+k ^) ~ exp (-d 2 (d- l)2*-27(n,d)-2) 

This implies our result for the following reasons. If M is a positive integer, — M < k, 

k 

l=-M+l 

Furthermore, equation {IJ establishes that a.a.s. v) > 2io — uj{n) for any function 
ui(ri) with hm n _ >0O a;(n) = oo. In particular, given e > 0, we can choose M = M t > 
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Figure 1 . First case - odd length. 




Figure 2. Second case - even length. 

sufficiently large so that P(S , j _m) > 1 — e. Given such an M, we use the previous 
equation to derive 

k 

¥(S t0+k ) > (1 - 6) H exp (-d 2 (d - if -27(M)-2) (1 _ o(1)) 



l=-M+l 



d 2 {d-l) 21 



= (1 — e) exp 
The same calculations also lead us to 

k 



:i-e)ex P Yl - (d _i)2 + 2 7M ) 

d((d - l) 2k+2M - 1) 



(d- l)2A/+2 7 (n,d)( d _ 2 ) 



rf((d-i) 2fe+2M - 1) 



exp 



l)2M+2 7 (M)(d-2) 



Putting the last two equations together and letting e — >• 0, during which we may assume 
M e — * oo, we have 

nSi +k) ~ exp I ^— - — 

as required. 

We now focus on proving the claim. First we would like to find the expected number 
of joins at time i = io + k given that Ni-i(u)nNi-i(v) = 0. Let t/j_i = Ni^i(u)\Ni^ 2 {u) 
and Vi-i = A r j„i(f) \ N^iv). These are the sets of vertices at distance % — 1 from u 
and respectively. We have to consider two types of join at time %. The first type (see 
Figure consists of edges that join one vertex in L^_i to a vertex in V^_i and therefore 
create ww-paths of odd length. The second type (see Figure EI) contains joins such that 
a vertex in V(G) \ (iVj_i(u) U iVj__i(f)) has neighbours in each of f/j_i and Vi-\. This 
generates a path of even length between u and v. 

Consider joins of the first type. Recall that we are considering the process of exposing 
the neighbourhoods of u and v step by step. After the first % — \ steps, we have exposed 
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the sets A r i _ 1 (u) and Ni_i(v), which are assumed to be disjoint. Recall that, according 
to the pairing model (see Introduction), any vertex in U^i and Vi-\ can be regarded 
as a cell of distinct points, where the number of points corresponds to the number of 
unexposed neighbours of this vertex. The probability that one given point joins another 
is then asymptotic to l/(dn), since any pair of unmatched points is equally likely to be 
paired and the whole process has, by Corollary 12. 2\ matched at most n 



l/2+o(l) 



on 



pairs of points to this moment. Asymptotically, there are |{7j_i||Vi_i|(d — 1) pairs of 
points such that one is associated with a vertex in £/j_i and the other with a vertex in 
Vi-\. This is because the hypothesis % = i$ + k implies, by Lemma \l. 11 that the number 
of vertices in Ui-\ or Vi-\ incident with 'bad edges' created at step -z — 2 is a.a.s. at 
most O (uo(n)) for any uj{n) — > oo, and it is clear that each vertex in U\-\ or V^_i with 
degree larger than 1 in G[Ni-i(u)] U G[Ni_i(v)] is incident with a 'bad edge'. 

Thus, the expected number of edges of the first type joining the neighbourhoods of 
u and v at time i — 1, that is, the number of pairs of points consisting of one point 
associated with a vertex in [Zj_i and one point associated with a vertex of Vi-\ exposed 
at time i is asymptotic to 

-, lA-l • 

an 

A similar argument shows that the expected number of edges of the second type is 
asymptotic to 

-7T - 5 n \\ v i-l\ — 1 W-l • 

d l n A dn 

Summing these, 

E(number of joins at time i = i + k | Si_i) 

(d- If d 2 (d-l) 2l ~ 2 d 2 (d-l) 2k 

n l^- 1 "^- 1 ! n ~ (d - l)2+27(n,d) ' 

We wish to apply the method of moments to establish 



F(S l0+k \ Sio+fc-i) ~ exp {-d 2 {d - l)»-27(n,d)-2) , 

so we have to verify that the j-th factorial moment of the random variable Z counting 
the number of joins at time i = i + k satisfies E([Z]y) = \{ 3 , for all j > 2. 

Let j > 2 and suppose that the subgraphs induced by Aj_ 1 (u) and Ni_i(v) are 
disjoint. As before, let £/j_i = A r i _ 1 (n) \ iVj_ 2 (w), Vf_i = A r i _ 1 (^) \ Ni_ 2 (v), and, given 
r G £/j_i, s G t G V (G)\(Ni_i(v) U Ni^u)) , introduce indicator random variables 
X( rjS ) for the event that rs is an edge in G and V( r ,s,t) f° r the event that rt and st are 
both edges in G. So, 

r G C/j_i, s G Vi_i r G L^_i, s G V^_i 

te V r (G)\(iV J „ 1 (i;)UiV l _ 1 (n)) 



is the random variable counting the number of joins that appear between the neigh- 
bourhoods of u and v at step i. 
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The j-th factorial moment of Z is given by 

3 

E([Zy = ^^P((X (Wm) = 1, 1 < m < I) A (Y(r m ,s m ,t m ) = 1, i + 1 < m < j)), (2) 
;=o * 

where, for any given I, denotes the sum over all distinct ordered pairs (r m , s m ), 
1 < m < I, and (r m , s m , t m ), / + 1 < m < j. 

We shall prove later that the relevant terms in this sum are the ones for which all the 
ordered pairs are disjoint, that is, there is no repetition of vertices among the j events. 
Assuming this, we obtain 



(d-lf \i( (d-lf y-i 

x 



dn — o(n)J \dn 2 — o(n 2 ) 

This is because there are ( ( j 1 ') ( n ^°j ) ways of choosing j vertices in each of Ui 
and Vi, and of choosing j — I vertices in V(G) \ (iYj_i(w) U Ni_i(v )). Moreover, pairing / 
of the chosen vertices in [/, with I of the chosen vertices in Vi can be done in (]) l\ ways, 
whereas there are [(j — /)!] 2 ways of creating triples on the remaining chosen vertices 
in Ui, Vi and the vertices chosen in V{G) \ (A r i _ 1 (n) U A r j_ 1 (w)). Now that we fixed 
distinct ordered pairs (r m , s m ), 1 < m < I, and (r m , s m ,t m ), I + 1 < m < j, the term 



( dn-o(n) ) ( dn 2 -o(n 2 ) ) 3 corresponds to the probability that all the events X, 
and Y(rm,s m ,t m ) — 1 occur simultaneously, since there is only a finite number of them. 
The previous sum is asymptotic to 



E 



1 







|£/i-i| j P 





j\ j\ (j — /)! l\ \ dn J \ dm? 



1=0 

j 



nidPjl ^ HJ- 1 Y- 



1=0 v 7 



lU^m^Hd-lfi If d\d-lf k ^ 



(EZ) 



n jj\ j\\ ' j\ \(d- 1) 2 +27(M) 

It remains to show that indeed the sum over all disjoint ordered pairs (r m , s m ), 
1 < m < I, and (r m , s m ,t m ), I + 1 < m < j, is asymptotic to the sum over all distinct 
ordered pairs. Suppose that there are j — a distinct elements appearing in the first 
coordinate, j — b in the second and j — I — c in the third, where a + b + c > 1. The 
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terms of this form in equation (J2J) are bounded above by 

y-y y-v fj ; ~ l\ f\Ui-i\\ f j - 1\ f\Vi-i\\ (j-l-\\(n - o(n) 



1=0 ** 



a J \ j — a ) \ b J \ j — b J \ c J \j — I — c 



I J \dn — o(n)J \dn 2 — o{n 2 ). 

where denotes the sum over all triples (a,b,c) G {0, . . . , j — l} 2 x {0, . . . , j — I — 1} 
satisfying a + b + c > 1. This is because there are ( -!T„ ) ways of choosing j 



vertices in and ways of building a multi-set of cardinality j with j — a given 
elements (and using all of them). The same is true for choosing vertices in V^_i and 
V(G) \ (Ni_i(u) U N i _ 1 (v)). Our last expression is smaller or equal to 

^^{j- - 1 - iy p.^j-a iv.^j-b n j-i-c 



a\b\c\ (j-a)\ (j-b)\ (j-l-c)\ 



j! 2 f (d-l) 2 yf (d-l) 3 y-i 

X 



l\ \dn — o(n)J \dn 2 — o(n 2 
If we divide this by l^-iHK-iPCrf-i) 3 ^ ^ s j s aS y m ptotic (with respect to n) to 



1=0 ** 



brnC ' 







l^-llll 





c — >• oo as n — > oo for 

every a + b + c > 1, we conclude that the above sum tends to zero as n tends to infinity 
and therefore the terms related to non-disjoint tuples in equation (J2J) can indeed be 
ignored to compute E([Z]j). 

Given this, the method of moments implies that 
P(no joins at time i = i + k \ N^v) n N^x(u) = 0) ~ exp (-d 2 (d - i) 2 *=- 2 7(",d)-2) ? 
which completes the proof of the claim and therefore establishes the lemma. □ 

We are now prepared to prove the result mentioned at the end of the last section. 

Corollary 2.4. Let u,v be vertices chosen independently with uniform probability in 
G G Gn,d- For any function u(n) such thatu(n) — > oo, the assertion d(u,v) < log d _ 1 n+ 
u(n) holds a.a.s. 

Proof. Let e > 0. Lemma \2. 31 implies that the probability of the event Ek that u and v 
are at distance greater than log,^ n + k is asymptotic to 

exp(-d 2 (d-l) 2fc - 2 ^- 2 ), 

where k a fixed integer and 7(71, d) is the fractional part of |log d _ 1 n. So, P(£fe) < e 
for k sufficiently large, and our result follows. □ 

Lemma 2.5. Let k be an integer and let j(n,d) be the fractional part of \ \og d _ 1 n. 
Define Oi to be the random variable counting the number ofuv-paths of odd length, that 
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is, paths of the first case, created at step i. Let Ei be the equivalent random variable for 
paths of even length. Then 

(i) With fl k = d(d - l)2fc"27(M)-2 ; 

F(O io+k = j | iV io+fc _i(u) n N i0+k ^(v) = 0) ~ ^ exp(-/i fc ). 

(ii) With V k = d{d - l)2*-27(n,<i)-l ; 



V 3 

F(E i0+k = j I N i0+k ^(u) n N iQ+k ^(v) = A O i0+k = 0) ~ ^ exp(-z/ fe ). 

Proof. This can be proven by the method of moments using calculations very similar 
to the ones in the previous lemma, proceeding separately for joins of the first type 
and joins of the second type. For the former, we condition on the event that no joins 
occurred in previous steps of the process, and, for the latter, we further assume that 
no joins of the first type occurred in the current step. The details are omitted. □ 



We observe that, alternatively, the proofs of the previous lemma and of Lemma 12.31 
could be unified by considering joint factorial moments of random variables for joins of 
the first type and of the second type. 

We are now ready to deduce one of the main results. 

Theorem 2.6. Fix an integer I > 1. The probability that two vertices u,v chosen 
independently with uniform probability in G G Q n ,d are joined by exactly I distinct 
geodesies is asymptotic to 

» (^-i)2^2 7( M)- 2 y / d ^ d _ 1)2fc - 27( M)-i 

^ l\ 6XP V ~d~~2 

k=— oo 

X (1 + (d - l) 1 exp(-d(d - i)2fc-2 7 (M)-i)) _ 

Proof. Let Z\ be the event that u and v are joined by exactly / geodesies, and let Jj 
denote the event that the first join occurs at time i. Then, given a positive integer M, 

io-M M-l 

nzd = nzi a u j k ) + Yl A J *o+ fc ) + A U Jfc )- 

k=l k=-M+l k>i +M 

The first and last element in the right-hand side can be made less than e, for any 
given e > 0, by choosing M = M e sufficiently large, as ensured by Corollary 12.41 and 
by the fact that equation (JTJ) holds a.a.s. Also, each of the terms P(Z/ A Ji +k), for 
—M + 1 < k < M - 1, is equal to 

¥(N io+k ^(u) n N i0+k ^(v) = 0) [F(O io+k = I | N i0+k ^(u) n N io+k ^(v) = 0) + 
+ F(O l0+k = | N io+k ^(u) n N io+k ^(v) = 0) x 
x F(E i0+k = I | O i0+k = A N io+k ^(u) n N i0+k ^{v) = 0)] . 
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By our previous lemmas, this is asymptotic to 

d(d- l)2fc-2 7 (n,d)-2\ / ( d ( d _ 1 )2fc-2 7 (n,d)-2y 



exp 



2! 



exp 



-d(d - i) 2fe - 2 7(M)-2) + 



+ 11 J L exp(-d(d - if*-*rM-2 _ d ( d _ 1 )2fe-2 7 (n,d)- 1 > 



/! 



Hence, if we let e tend to zero, 



P(3 



~ (d(d-l) 2k - 2 ^- 2 Y 
n ex P 



fc= — oo 
X 



d(d - l)2fc~2 7 (M)-l 



d-2 



(1 + (d - 1)' 6X P (-d(d - l)2fc-27(n,d)-l)) ) 



as required. 



□ 



An interesting special case is when / = 1, since this theorem provides the probability 
of u and v being joined by a unique geodesic. This probability is given by 



d(d 



exp 



k=— oo 



d(d - l)2fe-2T(n,d)-l 

d-2 



x (1 + (d - 1) exp(-d(d - i)2*-27(»,«0-i)) . 

The probability here is a function of 7(71, d) and oscillates as 7(71, d) varies from to 1. 

We include some numerical results in the table below for some values of d, where prob 
is the probability of a unique geodesic as 7(71, d) = and osc is the maximum variation 
with respect to 7 = as 7 varies from to 1. 



d 


prob 


osc 


3 


0.7213 


8.6 x 10~ s 


4 


0.6073 


1.4 x 10~ 3 


5 


0.5444 


7.9 x 10~ 3 


10 


0.4411 


7.6 x 10~ 2 


100 


0.3743 


0.3 



The magnitude of the oscillations depends on d. We justify why it is small when d 
is small. Note that the probability of a unique geodesic is equal to 



d-2 
d^T 



S(d-i)* ~l(n, d) + log/^i 



d 



] (d-l)(d-2) 



+ 



d-2 



d- 



j£(d-i) a ( -7(", rf ) + !og(d-i)2 



+ 
d 



d-2 



(3) 



where S c (x) = 

m=— 00 

summation formula gives us that 

00 

S c (x)= 



c m+x exp(— c m+x ), a function with period 1. The classical Poisson 



exp(— c t+x ) exp(27rimt) dt. 
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Setting z = c +x gives 



logc 



m=— oo 



S c (x) = — 2 exp(— 2mmx) / exp(— z + 27rzm log zj logc) ok, (4) 



and the integral is just Y{2nim/ log c + 1). 

By properties of the gamma function (see for instance pQ), we have 



7T 

|r(l + yi)| = \iyT(yi)\ = \y 



y smh(iry) 



so given m in the previous summation, 

|r(27rmi/ logc + 1) exp(— 27rimx)\ = |r(27rim/logc + 1)| 

V2 (5) 



2tt 2 


m 




log c 


sinh(2 


7T 2 m/ log c) 



The term for m = in the sum (j3J) is independent of x, hence it yields terms 
independent of n in equation (jSJ). In the special case d — 3, equation flSJ) leads to 
the following bounds on the other terms of the sum (J3J). For |m| = 1, the bound is 
approximately 4.32 x 10~ 3 , for \m\ = 2, it is approximately 4.94 x 10~ 6 , and for larger 
\m\ the bounds are even smaller. Similar observations explain the small oscillations 
when d is small. 



3. Almost geodesic cycles 

In the proof of Theorem II. II we shall use the following auxiliary result. 

Lemma 3.1. Let G £ Q n ^ and let u,v be vertices chosen independently at random in 
G. Consider functions a(n), (3(n) such that a(n),/3(n) — > oo, a(n) = o(log d _ 1 n) and 
f3(n) = o(a(n)). Then a.a.s. every vertex at distance \_ct(n)\ from u or v lies on at 
most one uv-path with length less than or equal to log d _ 1 n + f3(n). 

Proof. We prove this result for vertices at distance L a ( n )J from u, and by a similar 
argument the same result holds for vertices at distance L a ( n )J from v. As in Section^ 
we consider the process of exposing the neighbourhoods of u and v based on the pairing 
model. Here, N(u), the neighbourhood of u, is exposed for [o^ 77 -)] steps while N(v), 
the neighbourhood of v, is exposed for log d _ 1 n — fi(n)\ steps. By Lemma l2~T| a.a.s. 
N(u) and N(v) are both trees. Moreover, Lemma 12.31 ensures that N(u) R N(v) = 
holds a.a.s. 

Let U a be the set of vertices at distance L a ( n )J from u. Given a vertex w £ U a , let 
Y w be the indicator random variable for the event that w is connected to N(v) by at 
least two distinct paths of length less than or equal to \ log,^ n — ct(n) + 2/3(n) + 2. 
Define Y = Y2weu a ^ ^ s c ^ eai that this lemma follows if we prove that a.a.s. Y — 0. 
We shall do this by using 

p(y>i) < J2 p ( F - = 1 ). 

w£U a 

and by showing that the right-hand side goes to zero as n tends to infinity. 

For a fixed w, define the set iV^ obtained by the exposure of the neighbourhood of w 
for i log d _ 1 n — a(n) + 2f3(n) + 2 steps. This time, however, the neighbour of w in N(u) 
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is not added to N' w at the first step of the process, that is, only the "new" neighbours 
of w are exposed. As in Lemma 12.11 we use the term "bad edges" for edges that yield 
cycles in N' w . Consider the random variable X w counting the number of "bad edges" 
in N' w . Then, calculations analogous to the ones in Lemma [2. II establish that 

L| logd-i n-a(n)+2/3(n)+2j / / , 



E(X W ) Yl 



s=0 

O /(r] ~ V 



(d-1) 



n 



7/7 _ 1 Vog d _! n-2a(n)+4/3(n) \ 

^— )=0((d- l) . 

Thus Markov's inequality implies 

F(X W > 1) = O ((d - . 

Now, note that 

¥(Y W = 1) = F(X W > 1)¥(Y W =1\X W > 1)+F(X W = 0)F(Y w = 1\X w = 0). 

We have a bound for the first term in this sum. For the second term, we use the 
definition of conditional probability and observe that the event (Y w = 1) A (X w = 0) 
occurs only if there is a pair of distinct paths joining w to N(v) with length at most 
\ \o<g d _ x n — a{n) + 2(3{n) + 2 and with the property that, after they first split, they do 
not join again. 

So, a bound on W(Y W = 1 A X w = 0) may be obtained by counting the number of 
possible pairs of distinct paths P and Q joining itj to N(v) with lengths r and s, where 
r < s < g log d _ 1 n — a{n) + 2/3 (n) + 2, and the first j vertices are shared by both paths, 
while the remainder of the paths are internally disjoint. So, if i$ = [| logd_i n\ , 

[i -a(n)+2(3(n)+2\ [i -s} r-1 

F(Y W = 1AX W = 0)= EE 

'(d-l)^°-^\f n-o(n) \ fr + s - j - 2\ fr + s - 2 N 
2 ;Vr + s-7-2/V ? A r-1 

x j!(r- l)!(s- 1)\0 




Lio-a(n)+2/3(n)+2j [io-sj r-1 
s=l s=l j=0 

0((d-l)W- to W). 



(d- l)2*o-2/3(")(d_l)»-+--J 



n. 2 



Note that the formula holds because there are at most ((d-i) L ™ J ) 

ways of choosing 

two vertices in N(v) and there are ( r ™~°^ 2 ) ways of choosing vertices in the graph 
to include in the two paths. Moreover, these vertices can be divided into vertices of 
P (~)Q, P — Q and vertices of Q — P in ( r+s ~ ,J ~ 2 ) ( r ^ s ^ L 2 ) ways and can then be ordered 
to form the paths in j\(r — l)!(s — 1)! ways. Finally, each edge on the path appears 
with probability at most conditional on the fact that all previous edges on the 

path have appeared. 
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We conclude that 

¥(Y W = 1) = F(X W > 1)F(Y W = 1 | X w > 1) + ¥(X W = 0)P(Y w = 1 | X w = 0) 
< P(X W > 1) + P(F W = 1 | X w = 0) 

- (1 {(A _ 1 \49(n)-2a(r»n , 17 U» X J j_ 

17 U a ^ ^ + l_0(( d _l)4/3(n)-2a(n)) 

= O ((d - . 

Now, because there are O f(d — l) a ^) vertices at distance [ Q! ( n )J °f u i we have 

p(y > i) < X] P ( F - = x ) 

= O ((d - l) a(n) ) O ((d - l)^(«)-2a(n) j = ((d - . 

Because = o(a(n)), this term goes to zero as n tends to infinity and indeed 

P(y > l) 0. The lemma follows. □ 

We are now ready to prove the main theorem. 

Proof of Theorem M . 11 Part (i) of the theorem follows from Corollary 12.41 and equation 
©• 

Before proving part (ii), it is convenient to deal first with the following simpler goal. 
To state this we need two definitions. A k -near- geodesic is a path that is a geodesic 
between the two vertices at distance k from its ends. A vertex p on a path P between 
vertices u and v is said to be a midpoint of P if \dp(u,p) — dp(v,p)\ < 1, where dp 
denotes the distance along path P. 

Lemma 3.2. Asymptotically almost surely, for every two distinct k-near- geodesies, P 
and Q, between u and v with midpoints p and q, respectively, 

log d _i(n) - u(n) < d(p,q) < log d _ 1 (7i) +u(n). 

Proof. For the upper bound, we know as in part (i) of the theorem that a.a.s. d(u, v) < 
log d _ 1 n + w(n), hence there is a sufficiently short path connecting p to q through u or 
v. 

We turn to the lower bound. Given a function u>(n) satisfying u(n) — >• oo, we know 
that a.a.s. d(u,v) > log d _ 1 n — u(n) (see (JTJ)). Consider distinct k-near geodesies P and 

Q- 

Claim 1: P and Q a.a.s. do not have a vertex in common at distance at least ^j^- 
from their endpoints. 

We prove the claim by contradiction. Suppose without loss of generality that such a 
vertex is closer to u on P and let w be the vertex on P at distance [^^^J from u. Note 
that P and Q a.a.s. differ at some vertex or edge after w, since the set of vertices at 
distance at most a.a.s. induces a tree. But then, w lies on at least two distinct u, v 
paths with length less than or equal to log^-L n + loga;(n), which a.a.s. does not occur 



by Lemma EIU with a{n) = and (3{n) = loga;(n). (Note that the lengths of both P 
and Q are a.a.s. bounded by log d _ 1 n + log u;(n) because any A;-near geodesic between 
u and v has length at most d(u, v) + 4k.) This proves the claim. 

Now consider the event that the midpoints p and q of P and Q are at distance 
at most log d _ l n — u>(n). One way for this to occur is by the existence of a pg-path 
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Figure 3. Path R 



R of length at most log d _ 1 n — u(ri) using vertices and edges on P U Q only. But 
d(u, v) > log d _ 1 n — u(n) implies that R does not contain vertices at distance less than 
or equal to from u or v. Claim 1 shows that no other vertex can be in common. 
Thus, a.a.s. there is no short path from p to q using edges on P and Q only. 

So consider a geodesic A between p and q containing at least one edge outside PUQ. 
Using A oriented from p to q as a reference, let vp denote the last vertex on An P and 
let vq be the first vertex on A H Q after vp. The vertices vp and Vq divide the geodesic 
into three parts, namely from p to Vp, from Vp to Vq and from vq to q. Because P, Q 
are /c-near-geodesics between u and v for a fixed k and A is a geodesic between p and 
q, we must have cIa(p, Up) = dp(p,vp) and ^(^q, g) = cIq(vq, q). So, for p and g to be 
at distance at most log d _ 1 n — Lo(n) for some o;(n) — > oo, it must be that 

d A {v P , vq) < hg dr . 1 n - d P (p, v P ) - d Q (v Q , q) - u(n). (6) 

So, a short path between p and q has to be caused by a short path connecting a vertex 
in P to a vertex in Q which is internally disjoint from P U Q. More precisely, there 
must exist vertices vp, Vq on P and Q, respectively, and an v P VQ-paih R satisfying: 

V(R)nV(PUQ) = {v P ,v Q }, (7) 
\R\ < log^i n - d(p, wp) - d(v Q , q) - u(n). (8) 

Such a configuration is illustrated by Figure 3. 

We prove that a.a.s. G does not contain a path R satisfying ((7j) and (jSJ)- We do this 
by exposing the neighbourhoods of vertices along P and Q conditional on the particular 
paths P and Q being in the graph. By relaxing the condition that P and Q are fc-near 
geodesies (but retaining the condition that their length is at most d(u, v) + Ak + cu(n)), 
we may take the rest of the pairing to be random. We will later argue that the number 
of pairs of such paths P and Q is small enough for our argument to work. 

Given a vertex p r at distance r from p along P and a vertex q s at distance s from 
q along Q, let X Pr ^ a be the event that p r and q s are connected by a path of length at 
most log d _ 1 n — r — s — u(n) which is internally disjoint from P and Q. Define the 

random variable Y P;Q = eI^ 1 "" 1 ES^ 1 "" 1 " - ^*. so that Y P>Q = only if G 
does not contain a path R satisfying (JJJ) and (JHJ) with respect to P and Q. 

Once again, we look at the process in which the neighbours of p r and q s are exposed, 
then their neighbours are exposed, and so on, but we do not consider the neighbours 
of p r and q s on P or Q, so as to expose the sets Ni(p r ) and Ni(q s ) containing only the 
vertices at distance at most i from p r and q s that can be reached by paths internally 
disjoint from P and Q. Clearly, p r and q s are joined by a path as in (J7J) and JSJ) only if 
Ni(p r ) and Ni(q s ) join in at most ^(log^ n — r — s — ou(n)) steps. The probability of 
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this can be calculated as in the earlier sections, and we conclude that 
HYp,Q > 1) < EQO 

r|( lo Sd-i n-u))] r|(log d _i n-u])-r 

< E E p (^*) 

r=0 s=0 
r|( lo Sd-i n--w)l r|(log d _i n-w)]-r 

= 4 ^ ^ o((d-l) 2 (5 (logd - in - w) - , - s )/n) 

r=0 s=0 

= o ((d-iy^) . 

By Lemma with a(n) any function tending to infinity sufficiently slowly, and (3(n) = 
o(a(n)), the number paths of length at most log d „ 1 n + j3(n) between u and v is a.a.s. 
at most 2{d — l) a(n ), since this is a bound on the number of vertices at distance L a ( n )J 
from u. Thus, a.a.s. there are at most 7(71) pairs of such paths between u and v, for 
any 7(71) — > 00. Let Z denote this asymptotically almost sure event (for some a to be 
restricted shortly), and let Z be its complement. 

Let Y = X)pq^p,Q' where the sum is over all pairs of distinct paths between u and 
v whose length is at most d(u, v) + 4k + u(n). Then, 

p(y > 1) = p((y > 1) n z) + p((y >i)nz) 
<p(z) + ^p((y PiQ >i)nz) 

<p(z) + o( 7 H(d-i)^ (n) ), 

which tends to provided 7(71) = o((d — l) 1 ^^). As a consequence, a.a.s. there are 
no configurations satisfying the conditions in (J7J) and (JSJ). Hence, a.a.s. the inequality 
g) > log d _ 1 n — u{n) holds, and we the lemma. □ 

We return to the proof of the theorem. We say that two points on a cycle are 
diametrically opposite if the distance between them around the cycle is (_^/2j , where 
the cycle has length £. Note that part (ii) of the theorem follows immediately from 
the following, since if there is a "short-cut" for any two vertices on a cycle, there is a 
short-cut for a pair of diametrically opposite ones. 

Claim 2: Asymptotically almost surely, there is a cycle C passing through u and v, 
of length I satisfying |2 log d _ 1 n — £\ < u)(n), such that for every pair of points p and q 
that are diametrically opposite on C , d(p, q) > log^^ n — log d _ 1 log d _ 1 n — u(n). 

We now prove Claim 2. Note that, by (i), for all e > 0, there is K sufficiently large 
that ¥(d(u,v) > log d _ 1 n — K) > 1 — e for all n sufficiently large. From the proof of 
Lemma [2.31 it is evident that for each neighbour u' of u and neighbour v' of v, there is 
path from u to v in which the second and penultimate vertices are u' and v', 
and of length at most log d _ 1 n + u(n). Hence we may also assume that K is sufficiently 
large that the probability that there is such a path for two given neighbours v! and v' 
and of length at most \og d _ l n + K is at least 1 — e. So with probability at least 1 — 2s, 
we can choose two such wu-paths P and Q, where the neighbours of u and v on P are 
both different from those on Q. In each case we may select a shortest path with these 
specifications. Then P and Q must be (K + l)-near geodesies. 
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By Claim 1, we may assume there is no vertex in common between P and Q that is 
more than cj(n)/3 from u and v. For u growing slowly enough, there is a.a.s. no point 
in common that is at most u(n)/3 from u and v either, since Lemma 12. II implies that 
a.a.s. neither u nor v is in a short cycle. Let C be the union of the paths P and Q. 
From the bounds on d(u, v), C has length at least 2 log d _ 1 n — IK. 

To prove the statement about all diametrically opposite points p and q on C, we may 
rework the argument in Lemma 13.21 The Claim proved above shows that every short 
path of the type we are interested in must use some edge not on P or Q. Arguing as 
before, we only need to eliminate the existence of A such that © holds. The same 
argument as before shows that for any fixed such p and q, with Ypq defined as before, 
we again have P(Yp, Q >l)=0((d- l)" w(n) ). 

Now apply this inequality to the 0(log d _ l n) pairs of vertices p and q diametrically 
opposite on C. Also, put 

u(n) = log^ log^ n + 7(71). 

Then the probability that Ypq > 1 for at least one of these choices of p and q is 
O ((d-l)~^). Hence, if 7(71) — > 00, we have a.a.s. for all such p and q, d(p, q) > 
f(n) — log d _ 1 log^ n — 7(71). Replacing 7 by w gives the final statement in Claim 2, 
with probability at least 1 — 2e + o(l). This statement is true for all e > 0. That 
fact implies that the final statement in Claim 2 holds a.a.s. (This can be regarded as 
"letting e — > sufficiently slowly".) Combining this with part (i) proves Claim 2, since, 
although there may be different functions at the different occurrences of u, they can be 
made the same. This completes the proof of Claim 2. To obtain the theorem, we note 
that the proof of Claim 2 does more: it shows that the two paths can be chosen to use 
distinct neighbours of u in their initial step, and distinct neighbours of v in their final 
step. □ 



4. Final remarks 

In this article we have examined the "shape" of random regular graphs. This brings 
up related questions. 

Our proof of the main theorem can be seen to give more: a.a.s. for every pair of short 
(i.e. bounded length) paths, one containing u and one containing v, there is an almost 
geodesic cycle containing both of these paths. We also show that the paths referred to 
in the theorem each contain a geodesic between the two vertices at distance k from its 
ends, for any k tending to infinity with n. 

A geodesic cycle C in G is a cycle in which for every two vertices u and v in C, 
the distance da(u,v) is equal to dc(u, v). A significant open problem is to determine 
whether in a random <i-regular graph, a.a.s. almost all pairs of vertices lie in a geodesic 
cycle. It is not even known if at least one geodesic cycle of length asymptotic to log d _ 1 n 

exists cl.ci.S. 

We may also draw conclusions on how "thin" the topological triangles are in random 
regular graphs. Consider the proof of Lemma 12.31 which analyses the time at which 
two simultaneous breadth-first reaches from u and from v join each other. The proof 
is concerned with an accurate estimate of the probability that there are no joins by a 
time near It is easy to see from the ideas in the proof that for large K, the second 
join is quite likely to occur by time io + K, and furthermore that the first two joins 
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are quite likely to be in branches that diverged, in the breadth first search from u, at 
time less than K, and similarly from v. Let v! and v' be the points of divergence near 
u and v. Then the joins give two paths P and Q from u' to v', the shorter of which, 
say P, is geodesic, and we can choose another vertex, w, on Q, of distance K from 
u', such that the resulting two subpaths of Q to u' and v' from w are both geodesic. 
Thus u', v', w form a geodesic triangle. By Lemma 13.21 (noting P and Q are 2K-neax 
geodesies from u to v), the distance between the midpoints of P and Q is a.a.s. at least 
log d _ 1 (n) — tc>(n), where a;(n) is any function tending to oo. Hence the midpoint p of P 
a.a.s. has distance at least |log d _ 1 (n) — u){n) from the union of the other two sides of 
the geodesic triangle u', v', w. The probability in the above statements tends to 1 if we 
let K — > oo sufficiently slowly. It follows that a random d- regular graph, for d > 3, is 
a.a.s. 5-hyperbolic for 5 = (log d _ 1 n)/2 — uj{n). An upper bound on such S is obviously 
half of the diameter of the graph, which is (log^ n)/2 + 0(loglogn) by the main result 
of 0. 

Finally, it would be interesting to see to what extent the geometric properties we have 
addressed in this paper are preserved if the model of regular graphs changes. Particular 
models of interest might be random Cayley graphs, random lifts of regular graphs, and 
one can consider also some deterministic models of expanders. 
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